Systematic Verb Stem Generation For Arabic
نویسندگان
چکیده
Performing root-based searching, concordancing, and grammar checking in Arabic requires an efficient method for matching stems with roots and vice versa. Such mapping is complicated by the hundreds of manifestations of the same root. An algorithm based on the generation method used by native speakers is proposed here to provide a mapping from roots to stems. Verb roots are classified by the types of their radicals and the stems they generate. Roots are moulded with morphosemantic and morphosyntactic patterns to generate stems modified for tense, voice, and mode, and affixed for different subject number, gender, and person. The surface forms of applicable morphophonemic transformations are then derived using finite state machines. This paper defines what is meant by ‘stem’, describes a stem generation engine that the authors developed, and outlines how a generated stem database is compiled for all Arabic verbs.
منابع مشابه
Borrowing the Verb “ast” and Its Varieties in Arabic Dialect of Sarab
“Borrowing” is a lingual process that is studied in diachronic linguistics. In this process a language borrows elements from another language. This process usually occurs in areas that two languages make contact with each other. In a dialect spoken in South Khorasan the language borrowing happens. Arabs living in this part of Iran probably have immigrated in the early centuries of Islam. In thi...
متن کاملTracking Morphophonemic Transformation in Arabic Word Generation and Root Extraction
Performing root-based searching, concordancing, and grammar checking in Arabic requires an efficient method for matching stems with roots and vice versa. Such mapping is complicated by the hundreds of manifestations of the same root; the radicals often undergo replacement, fusion, inversion, and/or deletion. It is a challenge, therefore, to keep track of original radicals. An algorithm based on...
متن کاملArabic Morphology Generation Using a Concatenative Strategy
Arabic inflectional morphology requires infixation, prefixation and suffixation, giving rise to a large space of morphological variation. In this paper we describe an approach to reducing the complexity of Arabic morphology generation using discrimination trees and transformational rules. By decoupling the problem of stem changes from that of prefixes and suffixes, we gain a significant reducti...
متن کاملConstructing An Automatic Lexicon for Arabic Language
In this paper, we have designed and implemented a system for building an Automatic Lexicon for the Arabic language. Our Arabic Lexicon contains word specific information. These pieces of information include; morphological information such as the root (stem) of the word, its pattern and its affixes, the part-of-speech tag of the word, which classifies it as a noun, verb or particle; lexical attr...
متن کاملClassifying Arabic Verbs Using Sibling Classes
In the effort of building a verb lexicon classifying the most used verbs in Arabic and providing information about their syntax and semantics (Mousser, 2010), the problem of classes over-generation arises because of the overt morphology of Arabic, which codes not only agreement and inflection relations but also semantic information related to thematic arity or other semantic information like ”i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004